Answer Retrieval From Extracted Tables

نویسندگان

  • Xing Wei
  • Bruce Croft
  • David Pinto
چکیده

Question answering (QA) on table data, which contains densely packed information in two-dimensional form, is a challenging information retrieval task. Data can be placed at a distance from the metadata describing it. The metadata itself can be difficult to identify given the layout of a particular table. This paper describes a QA system for tables created with both machine learning and heuristic table extraction methods. Our approach creates a cell document for each table cell. A probabilistic language model selects the most likely cell documents for the information need. The performance of the system is tested with government statistical data, and errors are analyzed in order to improve the system. We also apply these improvements on another type of table data set and show the experimental results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Machine Translation for Query Expansion in Answer Retrieval

We present an approach to query expansion in answer retrieval that uses Statistical Machine Translation (SMT) techniques to bridge the lexical gap between questions and answers. SMT-based query expansion is done by i) using a full-sentence paraphraser to introduce synonyms in context of the entire query, and ii) by translating query terms into answer terms using a full-sentence SMT model traine...

متن کامل

Improving Text Retrieval Precision and Answer Accuracy in Question Answering Systems

Question Answering (QA) systems are often built modularly, with a text retrieval component feeding forward into an answer extraction component. Conventional wisdom suggests that, the higher the quality of the retrieval results used as input to the answer extraction module, the better the extracted answers, and hence system accuracy, will be. This turns out to be a poor assumption, because text ...

متن کامل

Coling 2008 22 nd International Conference on Computational Linguistics

Question Answering (QA) systems are often built modularly, with a text retrieval component feeding forward into an answer extraction component. Conventional wisdom suggests that, the higher the quality of the retrieval results used as input to the answer extraction module, the better the extracted answers, and hence system accuracy, will be. This turns out to be a poor assumption, because text ...

متن کامل

Answer Passage Retrieval for Question Answering

Document or passage retrieval is typically used as the first step in current question answering systems. The accuracy of the answer that is extracted from the passages and the efficiency of the question answering process will depend to some extent on the quality of this initial ranking. We show how language model approaches can be used to improve answer passage ranking. In particular, we show h...

متن کامل

Attribute Retrieval from Relational Web Tables

In this paper, we propose an attribute retrieval approach which extracts and ranks attributes from HTML tables. Given an instance (e.g. Tower of Pisa), we want to retrieve from the Web its attributes (e.g. height, architect). Our approach uses HTML tables which are probably the largest source for attribute retrieval. Three recall oriented filters are applied over tables to check the following t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004